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We  consider  the  solution  of  the  single  comodity  strictly  convex 
network  flow  problem  in  a  distributed  asynchronous  computation  environment. 
The  dual  of  this  problem  is  unconstrained,  differentiable,  and  well  suited 
for  solution  via  Gauss-Seidel  relaxation.  We  show  that  the  structure  of 
the  dual  allows  the  successful  application  of  a  distributed  asynchronous 
method  whereby  relaxation  iterations  are  carried  out  in  parallel  by  several 
processors  in  arbitrary  order  and  with  arbitrarily  large  interprocessor  com¬ 
munication  delays. 
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1 .  Introduction 


Consider  a  directed-  graph  with  set  of  nodes  N  and  set  of  arcs  A.  Each 
arc  (i,j)  has  associated  with  it  a  cost  function  g^:  R  -*■(-«, •*»] .  We  denote 
by  f^  the  flow  of  the  arc  (i,j)  and  consider  the  problem  of  minimizing 
total  cost  subject  to  a  conservation  of  flow  constraint  at  each  node 


minimize  £  g^(f,J 
(i.j)eA 

subject  to  l  f  .  -  I  f. .  ■  0,  y  ieN. 

(mti)eA  ml  (i,j)eA  lj 


We  make  the  following  standing  assumptions  on  g^ : 

a)  g is  strictly  convex,  lower  semicontinuous,  and  subdifferentiable 
at  each  point  within  its  effective  domain,  i.e.  has  at  least  one  sub¬ 
gradient  at  every  f^  with 

b)  The  conjugate  convex  function  of  g^,  defined  by 

*ij(V  *  ’up 

is  real  valued,  i.e.  -»  <  gJjCtjj)  <  •  for  all  real  tj^.  (Because  of  the 
strict  convexity  assumed  in  a)  above,  gf^  is  also  continuously  differentiable 
and  its  gradient  denoted  is  the  unique  f_  attaining  the  supremum 

in  (2) --see  [7],  pp.'  218.2S3J. 

It  is  easily  seen  from  (2)  that  Assumption  b)  implies  that 
lim  g. .(f. .)  *  *.  Therefore  the  objective  function  of  the  primal 

i  fi.j  1"**° 

problem  (1)  has  bounded  level  sets  ([  7],  Section  3).  It  follows  that 
there  exists  an  optimal  solution  for  problem  (1)  which  must  be  unique 
in  view  of  the  strict  convexity  assumed  in  a) . 

The  problem  above  is  of  great  practical  interest  and  has  been  studied 
for  a  long  time.  Except  for  strict  convexity  our  assumptions  are  not  overly 
restrictive.  For  example  they  are  satisfied  in  the  following  two  cases: 


1)  The  constrained  case  where  g^  is  of  the  form 


W 


I  ;tJcv 


if  f^ 
otherwise 


where  2.^  and  c^  are  given  lower  and  upper  bounds  on  the  arc  flow,  and  g^ 
is  a  strictly  convex,  real  valued  function  defined  on  the  real  line  R. 

2)  The  unconstrained  case  where  g„  is  strictly  convex,  real  valued  and 

0 

its  right  and  left  derivatives  g^  and  gT^  satisfy 


11. 

ij 


A  dual  problem  for  (1)  is  given  by 


minimize  q(p) 

subject  to  no  constraints  on  the  vector  p  ■  {p^jieN}, 


where  q  is  the  dual  functional  given  by 


q(P)  ■  I  (Pi-Pi) 

(i,j)eA  13  1  J 


We  refer  to  p  as  a  price  vector  and  its  components  p^  as  prices .  The  ith 
price  is  really  a  Lagrange  multiplier  associated  with  the  ith  conservation 
of  flow  constraint.  The  duality  between  problems  (1)  and  (5)  is  well  known 
and  is  explored  in  great  detail  in  the  recent  book  by  Rockefeller  [1].  The 
earlier  book  by  Rockefeller  [7]  gives  the  necessary  and  sufficient  condition 
for  optimality  of  a  pair  (f ,p) .  A  feasible  flow  vector  f  ■  {f^^ [ (i*j)cA}  is 
optimal  for  (1)  and  a  price  vector  p  ■  fp^JicN  is  optimal  for  (5)  if  and 


4 


only  if  for  all  arcs  (i,j)  ([7],  pp.  337-338) 


Pi»Pj  is  a  subgradient  of  g^.  at  f^ 


An  equivalent  condition  is 


£ij  "  v  Ci.5)eA. 


Any  one  of  these  equivalent  relations  is  referred  to  as  the  coapleaentai 
slackness  condition,  and  is  shown  in  Figure  1. 


Since  the  dual  problem  is  unconstrained  and  differentiable  it  is 
natural  to  consider  algorithmic  solution  by  a  descent  iterative  method. 

The  Gauss-Seidel  relaxation  method  is  particularly  interesting  in  this  respect 
since  it  admits  a  simple  implementation.  Given  a  price  vector  p» a  node  i  is 

A 

selected  and  its  price  p^  is  changed  (relaxed)  to  a  value  p^  such  that 


,  I,. . 

(m,i)eA 


l  Vg?,(Pi-p.). 
(i.j)eA  13  1  3 


It  is  easily  seen  (compare  with  the  definition  (6)  of  the  dual  cost  q)  that 
this  equation  is  equivalent  t0  fp  *  0,  so  the  dual  cost  is  minimized  at  p^ 
with  respect  to  the  ith' price,  all  other  prices  being  kept  constant.  The 
algorithm  proceeds  by  relaxing  the  prices  of  all  nodes  in  cyclic  order  and 
repeating  the  process.  The  convergence  of  this  algorithm  does  not  follow 
immediately  from  standard  results  on  relaxation  methods  [2],  [3],  [4]  since 
these  results  require  some  assumption  that  is  akin  to  strict  convexity  of 
the  dual  objective  function  (for  a  counterexample  see  Powell  [5]).  Un¬ 
fortunately  the  dual  objective  function  (6)  need  not  be  strictly  convex  and 
the  only  result  we  know  of  regarding  convergence  of  a  network  algorithm  based 
on  relaxation  is  due  to  Cottle  and  Pang  [6] .  It  applies  to  transportation 
problems  with  quadratic  cost  function,  and  involves  certain  restrictions  in 
the  way  relaxation  is  carried  out. 

Our  main  objective  in  this  paper  is  to  explore  the  convergence 
properties  of  distributed  versions  of  the  relaxation  method  just  described. 
Here  we  assume  that  each  price  p^  is  under  the  control  of  a  separate 

A 

processor  who  changes  p^  to  p^  on  the  basis  of  (8)  and  communicates  the 
new  value  to  the  other  processors.  One  can  consider  a  parallel  computation 
procedure  carried  out  in  an  orderly  manner  whereby  all  processors  exchange 


.■ /■  ^ 


their  current  prices  before  carrying  out  their  relaxation  iteration. 
Mathematically  this  would  be  equivalent  to  a  Jacobi  type  of  relaxation 
procedure.  We  would  like  to  consider,  however,  a  much  more  general  procedure 
whereby  the  comnunication  between  processors  is  not  regular,  and  the  information 
available  at  some  processors  regarding  prices  of  other  processors  may  be 
arbitrarily  out-of-date.  In  addition  we  allow  some  processors  to  iterate 
more  frequently  than  others.  Models  of  such  asynchronous  algorithms  have 
been  formulated  some  time  ago  and  by  now  there  is  considerable  tnderstanding ' 
of  their  convergence  properties  (see  [8]- [161;  [17]  is  a  survey).  It  turns 
out  that  the  dual  problem  (S)  has  structure  that  allows  us  to  show  that 
the  asynchronous  relaxation  method  has  satisfactory  convergence  properties. 

This  is  particularly  true  when  the  dual  problem  (5)  has  a  unique  optimal 
solution.  Otherwise  satisfactory  convergence  depends  on  the  starting  point. 
These  results  are  all  new  and  are  shown  in  Section  3.  The  next  section 
analyzes  the  structure  of  the  dual  solution  set  and  provides  some  preliminary 
analysis . 

Our  notational  conventions  are  that  a  subscript  denotes  a  node  or  pro¬ 
cessor  index,  and  a  superscript  denotes  a  time  or  iteration  index.  All  vector 
inequalities  should  be  interpreted  in  a  coordinatewise  sense.  In  order  to 
simplify  notation  we  have  implicitly  assumed  that  there  is  at  most  one  arc 
associated  with  any  ordered  pair  of  nodes  i  and  j ,  so  that  the  arc  notation 
(i,j)  has  unambiguous  meaning.  However  this  assumption  is  not  essential 


-7- 


2.  Structure  of  the  Optimal  Dual  Solution  Set 

Our  standing  assumptions  a)-c)  guarantee  that  the  primal  problem  (1) 
has  a  unique  optimal  solution.  They  also  guarantee  that  the  dual  problem 
(5)  has  at  least  one  optimal  solution.  (Assumption  a)  guarantees  that  every 
feasible  solution  of  the  primal  problem  is  regularly  feasible  [1]  and  the 
existence  theorem  of  [1],  p.  360  applies).  On  the  other  hand  the  optimal 
solution  of  the  dual  problem  (S)  is  never  unique  since  adding  the  same  con¬ 
stant  to  all  coordinates  of  a  price  vector  p  leaves  the  dual  cost  unaffected. 
We  can  remove  this  degree  of  freedom  by  constraining  the  price  of  one  node, 
say  node  N,  to  be  zero.  Thus  we  consider  the  reduced  dual  optimal  solution 
set  P*  defined  by 

P*  -  t'p*|  q(p*)  »  nin  q(p) ,  pj  -  0)  (9) 

P 

where  q  is  the  dual  objective  function 

q(P)  ■  I  (1°) 

For  the  most  part  of  the  paper,  we  will  operate  under  the  following 
assumption: 

Assumption  1:  Ihe  reduced  dual  optimal  solution  set  P*  is  compact. 

Asstnption  1  is  not  overly  restrictive.  For  example  let  {f*^  |(i,j)eA) 
be  the  unique  primal  optimal  solution,  and  consider  the  set  of  arcs 


A  ■  {(i»j)lfjj  lies  in  the  interior  of  the  set  {f^  |g^  (f^)**}} . 

(ID 

A 

Then  Assumption  1  is  satisfied  if  the  subgraph  "(N, A)  is  connected.  To  see 

A 

this  note  that  for  all  arcs  (i,j)eA  we  have  a  bounded  set  of  subgradients 


r#  ,  ‘ 


of  g^  at  thereby  implying  a  bounded  set  of  price  differences  P^"Pj 
corresponding  to  dual  optimal  solutions  [cf.  (6)].  Note  that  in  the  un- 

A 

constrained  case  mentioned  in  the  previous  section  every  arc  belongs  to  A 
so ,  if  the  original  graph  is  connected.  Assumption  1  is  satisfied.  The  con¬ 
strained  case  of  the  previous  section  can  be  converted  to  the  unconstrained 
case  by  replacing  constraints  by  nondifferentiable  penalty  functions  (see 
[18],  Section  5.5].  For  example  a  constraint  f^  _>  0  can  be  eliminated  by 
adding  to  the  cost  g^.  a  penalty  c  max{0,-f^.}  ♦  with  c 

positive  and  sufficiently  large. 

Consider  now  the  set 

P  *  (p|pN  *  0}  (12) 

and  for  i  ■  1,...,N-1,  the  point-to-set  mapping  Ri  which  assigns  to  a  price 

A 

vector  peP  the  set  of  all  prices  p^  that  minimize  the  dual  cost  along  the 
ith  price  starting  from  p,  i.e.  (cf.  (8)) 

Vp)  »  tPil  I  78ii(Pn'Pi)  *  ^  (13> 

It  is  well  known  that  a  real  valued  convex  function  having  one  compact  level 
set,  has  all  its  level  sets  compact  ([7],  p.  70).  Therefore  under  Assumption 
1  the  sets  R^(p),  peP  are  all  nonempty,  compact  intervals.  It  follows  that 
under  Assumption  1  the  (point-to-point)  mappings 

A 

R,  (p)  ■  a  max  p,  04) 

P^Cp) 


A 

R.  (p)  ■  a  min  p,  (15) 

PleRi(p] 


-9- 


are  well  defined  on  the  set  P.  We  call  (1L)  the  ith  maximal  (minimal) 
relaxation  napping.  It  gives  the  maximal  (minimal)  minimizing  point  of  the 
dual  cost  along  the  ith  coordinate  starting  from  its  argument.  The  point- to- 
set  mapping  R^  is  called  the  ith  relaxation  mapping. 

Some  key  facts  are  given  in  the  following  proposition. 

Proposition  1;  Let  Assumption  1  hold.  The  mappings  IT  and  are 
continuous  on  P.  They  are  also  monotone  on  P  in  the  sense  that  for  any 


p,  P’SP,  i 

*  1 , . . . ,N  we  have 

Vp) 

<  Vp’) 

if  P  <  P', 

(16) 

^(p) 

<  ^(P*) 

if  p  £  p’ . 

(17) 

Proof ;  To  show  continuity  of  R^  we  argue  by  contradiction.  Suppose 

V 

there  exists  a  convergent  price  vector  sequence  p  ■+•  p  such  that  the  cor- 

—  k  _ 

responding  sequence  {R^(p  )}  does  not  converge  to  R^(p).  By  passing  to  a 


subsequence  if  necessary  suppose  that  for  some  6  >  0  we  have 

Vp)  -  Vpk)  *  6t  V  k  (18) 

(the  proof  is  very  similar  if  6  <  0  and  the  inequality  is  reversed) .  By 
the  definition  of  FL  we  have 

I  7S;i Cp»  *  l  ^jCiypJ-Pj),  (19) 

1 7*ii0vVpk>>  •  1 7*j.(?1cplt)-p‘),  »  k.  (20 


»V  tv 


Since  p  ■*  p  it  follows  using  (IS)  that  for  sufficiently  large  k  we  have 
p£-ypk)  >  Pm-RiCp)  .  V  (a.i)eA 

RiCpk)-Pj  <  R^PJ-Pj  .  V  (ij)eA. 


Therefore  for  sufficiently  large  k  we  have  using  the  convexity  of  gV ,  g£. 


7lV(p£-Ri(pk))  >  ^(P^CP)) 

7g?j(R1(pk)-p^)  <  TiJjCRjtpJ-Pj), 


,  V  (a,i)eA, 


,  V  (i, j)eA. 


Using  these  relations  together  with  (19),  (20)  we  obtain  for  all  sufficiently 


large  k 


f«i  ■  7*rt«V*i(rt)  ‘  7^(VRi(Pk))'  7 


fij  ‘  *  Vg^f^CP^-Pj).  V  (i.j)eA.  (22) 


Consider  the  intervals  I  .  and  I.,  given  by 

m  xj 


**  ’  {tl  Ki™  ’  fai> 


,  V  (a,i)eA 


lLj  -  (t |  7g^(t)  -  ftj}  ,  V  (i.j)eA. 


For  k  sufficiently  large  so  that  (21),  (22)  hold  we  have 


Rt(p)  -  aax{pi|pi  e  Pm-Iai,  (m,i)  sA,  P^I^-P^  (i,j)eA) 


^(P*)  «  nax{pi  |p^  e  Pa"Ini»  (m,i)eA,  p£  e  I± ^ ”Pj *  (i.j)eA) 


v.V.v.v..  .v.v.v.v.v.  -  .  .-.v.v,v  v.  -v^Y.  v  v  <.  <■  »  .  • 
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k  _  k  — 

Since  p  -*>  p,  it  is  evident  from  these  relations  that  R^(p  )  -*■  R^(p)  thereby 


contradicting  (18) 


To  show  monotonicity  of  fT  we  again  argue  by  contradiction.  Suppose 
there  exist  p  and  p*  such  that  pj  >,  Pj»  V  j»l,...,N-l  but 
^(p)  >Ri(P’)*  It  follows  then  that 


P^-RjCp’)  >  Pa-^CP),  V  (m,i)eA 


RiCp'3-Pj  <  R^P)^,  v  Ci,j)eA. 


Therefore 


i  V  (n.i)eA 


PgJ.CRjCp'J-pj)  <  PgJjtRjtpj-Pj),  V  (i.j)cA 


Since  by  definition  we  have 


l  VR^tPi-RJP'))  •  $  7jj.{iC.(p’)-pp 
m  i 


l  7C(vRic'»>  •  l  ^Aw-pj) 

m  ] 


it  follows  that  equality  holds  in  (23),  (24),  i.e. 


*.1  •  ^tp;-Ri(P'»  ■  v^1(pb-r1(P)) 


ftj  -  7*ijCR1(P')-Pp  *  ^*1  j CRt tp) -P j J 


. . 


Consider  the  intervals 


Iij  ■ 
and  let 


W 


J 


5  -  RjCp)  -  ^Cp’). 

We  have  for  all  (m,i)eA 

K  -  '.i-  p.  -  ri'p)6  J»i 

and  since  p  <  p’  we  obtain 
m  “  in 


PB  -  *itP)  <  PB  *  -  S  <  PB  -  Ri(p')* 


Therefore 

* 

Pi  ’  RiCpn  -  5elBi,  V  (m,i)eA 

and  similarly 

R^Cp')  ♦  6  -  Pj  e  liy  V  (i.j)eA. 

It  follows  that 

^(p’)  ♦  SeR^’) 

thereby  contradicting  the  maximal  nature  of  [cf.  (14)]. 

The  proof  of  continuity  and  monotonicity  of  R.  is  analogous  with  the 


Q.E.D. 


one  just  given  for  and  is  omitted. 

The  mono tonicity  and  continuity  of  the  aappings  R,  and  R.  imply 

a  thus  far  unreported  and  somewhat  surprising  property  of  the  optimal 
dual  solution  set. 

Proposition  2;  Let  Assumption  1  hold.  7here  exist  a  maximal  and  a  minimal 
optimal  solution  of  the  dual  problem,  i.e.  there  exist  peP*  and  £eP*  such 
that 

£  <  P  <  P»  v  PeP*.  (27) 

Proof:  Since  P*  is  nonempty  and  compact  it  contains  a  noninferior  element 
p  for  which  there  is  no  vector  peP*  such  that  p  i  p  and  p.  >  pT,  for  all 
i.  From  the  definition  of  and  the  optimality  of  p  we  have  p^  5  l.(p)  for 

all  i.  Furthermore  for  all  i  the  vector  (pj, . . .  R^p) ,  Pi+1»****PN) 

belongs  to  P*  so  from  noninferiority  of  p  it  follows  that  R. (p)  <_ p. . 

Therefore  we  have  p^  •  R^(p)  for  all  i.  Let  now  p  be  a  price  vector 
obtained  from  p  according  to 

IPi  *  <$  ,  i  ■  1 , . . .  ,N-1 

0  i  -  N  , 

where  5  >  0  is  sufficiently  large  so  that 


It  is  easily  seen  that  we  have  R-^p)  £  pi§  for  all  i  so,  using  the 
nonotonicity  of  Rj  shown  in  Proposition  1 ,  we  obtain 

—  —  k+l  "  *■ 

P  <  R  (P)  <  Rip)  ,  V  k  (29) 

—  1  v_i 

where  R:  R*  -*■  R*  is  the  mapping 

R(P)  *  CR1Cp).-..,Ri,.1Cp)3  (30) 

.A  _ 

and  R  is  the  composition  of  R  with  itself  k  times.  From  (29)  we  see  that 
the  sequence  1  (p)  converges  to  some  p  and  by  continuity  of  R  we  must  have 

A  A  A  A  A  <*, 

P  *  R(p)  as  well  as  p  >  p.  Since  p  ■  R(p)  implies  that  peP*  it  follows 

A 

from  the  choice  of  p  that  p  •  p .  Also  from  (28) ,  (29)  and  the  fact 

A 

P  £  R(P)  for  all  peP*  we  obtain  p  »  p  >  p  for  all  pep*  which  shows  that 
p  is  a  maximal  element  of  P*.  The  proof  for  existence  of  a  minimal 

element  £  is  entirely  similar.  Q.E.D. 
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5.  Convergence  Analysis  of  Asynchronous  Relaxation 

The  model  of  distributed  asynchronous  computation  we  adopt  is  described 
in  [11],  [12].  With  each  node  i  ■  1,...,N-1  we  associate  a  processor  that 


computes  from  time  to  time  some  element  of  R^Cp)  [here  p  is  the  latest 
price  vector  available  to  processor  i),  and  sets  the  price  p^  to  this 
element.  This  price  is  then  communicated  at  some  later  time  to  all  other 


processors.  Computation  and  communication  at  the  various 


processors  need  not  be  synchronized.  The  precise  model  is  as  follows: 


At  each  time  instant,  node  i  can  be  in  one  of  three  possible  states 
compute,  transmit,  or  idle.  In  the  compute  state  node  i  computes  a  new 
price  p^.  In  the  transmit  state  node  i  communicates  the  price  p^  obtained 
from  its  own  latest  computation  to  one  or  more  nodes  m(mi<i) .  In  the  idle 


state  node  i  does  nothing  related  to  the  solution  of  the  problem. 

We  assume  that  computation  and  transmission  for  each  node  takes  place 
in  time  intervals  t^tj]  with  tj  <  t2,  but  do  not  exclude  the 
possibility  that  a  node  may  be  simultaneously  transmitting  to  more  than  one 
node  nor  do  we  assume  that  the  transmission  intervals  to  these  nodes  have 


the  same  origin  and/or  termination.  We  also  make  no  assumptions  on  the 
length,  timing  and  sequencing  of  computation  and  transmission  intervals 


other  than  the  following: 


Assumption  2:  For  every  node  i  and  time  t  >  0  there  exists  a  time  t'  >  t 

such  that  [t,t'J  contains  at  least  one  computation  interval  for  i  and 

at  least  one  transmission  interval  from  i  to  each  node  m  such  that  (m,i)eA 
or  (i,m)eA. 

Assumption  2  is  very  natural.  It  states  in  essence  that  no  node 
"drops  out  of  the  algorithm"  permanently— perhaps  due  to  a  hardware  failure. 
Without  this  assumption  there  is  hardly  anything  we  can  hope  to  prove. 
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Each  nod©  i  has  a  buffer  B^  for  each  mj*i  where  it  stores  the 

latest  transmission  from  m,  as  well  as  a  buffer  B^  where  it  stores  its 

own  price  estimate  p^.  The  contents  for  each  buffer  at  time  t  are 

denoted  ptCi)«  Thus  p*(i)  !*•  for  every  t,  i  and  m  an  estimate  of  the 
n  b& 

price  p  available  at  node  i  at  time  t.  It  is  important  to  realize  in 

m 

what  follows  that  the  buffer  contents  p*(i),  and  p*(i')  at  two  different 

Bl  Ml _ 

nodes  i  and  i*  need  not  coincide  at  all  times.  If  ijha  and  i1^  m  the 
buffer  contents  pt(i),  and  p*(i')  need  not  coincide  at  any  time  t.  The 
vector  of  all  buffer  contents  of  node  i  is  denoted  pt(i)>  i.e.. 


» V ’> 


p*(i)  *  (p*(i)  I*  •  1.....N-1}. 


The  rules  according  to  which  the  buffer  contents  pB(i)  are  updated  are 
as  follows: 

(1)  If  [tj^tj]  is  a  transmission  interval  from  node  m  to  node  i,  the 

contents  of  the  buffer  B  at  time  t.  are  transmitted  and  entered  in  the 

mm  x 

buffer  B^  at  time  t2>  i.e. 


t-  t  j 

VCD  -  p.  w. 


(31) 


(2)  If  [t. ,t,]  is  a  computation  interval  for  node  i,  the  content  of 

*1 

the  buffer  B^  is  replaced  at  time  t2  with  an  element  of  Rj(p  (i)),  i.e. 


*2  *1 
Pi  (i)  e  RjCP  (D) 


(32) 


(3)  The  contents  of  a  buffer  B^  can  change  only  at  the  end  of  a 
computation  interval  for  node  i.  The  contents  of  a  buffer  V  iyta  can 
change  only  at  the  end  of  a  transmission  interval  from  m  to  i. 


v  v  w.v  *i*.  a  tvwvpjjp  ■  ■■>  ■■*.  j  ■■ .  i  -n  ■■■.! . .  ^  ^  „ 

$  .  ^ 

& 


K 

1 


$ 


& 
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The  algorithm  based  on  (32)  will  be  called  Asynchronous  Relaxation 
Method  (ARM) . 

Our  objective  is  to  derive  conditions  under  which  limit  points  of 
the  sequences  (pt(i)}  are  optimal  solutions  of  the  dual  problem  (5).  The 
following  proposition  is  our  main  result.  The  proof  is  based  on  a  gen¬ 
eral  convergence  theorem  given  in  [12]  (see  also  [17])  and  applicable  to 
asynchronous  iterative  algorithms  such  as  the  one  just  described.  The  key 


Ks" 

KV 

IV 


property  that  makes  asynchronous  convergence  possible  is  the  monotonicity 
of  the  mappings  IT  and  shown  in  Proposition  1.  This  property  is  also 
present  in  dynamic  programing  models  and  has  been  similarly  exploited 
to  show  the  validity  of  asynchronous  versions  of  the  successive  approximation 
method  [11]. 


Proposition  3:  Let  Assumptions  1  and  2  hold.  For  any  initial  buffer 
contents  p°(i)  eP,  i  »  1,...,N-1,  each  limit  point  of  the  sequences 
(pt(i)}  generated  by  the  ARM  belongs  to  the  set 

P  •  tp  i  £  ;<  P  i  P^  (33) 


where  p  and  £  are  the  maximal  and  minimal  dual  optimal  solutions.  In 
particular,  if  the  reduced  dual  optimal  solution  set  P*  consists  of  a 
unique  vector  p*  we  have 

lim  p*(i)  *  p*  ,  i  ■  1,...,N-1.  (34) 

tr**> 

Proof:  Let  £,  ji  e  P  be  price  vectors  such  that 
g  <  P°(i)  <  P  Vi-  1.....N-1 

and  such  that 
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p  <  R(p)  <  £  <  p  <  1T(P)  <  p. 


lia  Rk(p)  ■  p  ,  lia  R^Cp)  *  P- 
k—  ”  *  k— 


(The  existence  of  such  vectors  was  established  in  the  proof  of  Proposition 
2).  Consider  the  sets 


P*  -  {p  j  Rk(p)  1  P  i  **(p)}  ,  k  *  1.2, 


(35) 


r«kl  j . 


Note  that  the  sequence  {P  }  is  nested  and  that  the  conmon  intersection  of 
the  sequence  is  the  set  P  of  (33) . 

We  will  apply  now  a  convergence  theorem  given  in  Section  3  of  [12] 
(arProp.  3.1  of  [17])..  According  to  this  theorem  the  desired  result 
will  be  proved  if  the  following  three  conditions  are  satisfied.  (Rather 
than  consulting  the  references  just  cited,  the  reader  may  wish  to  think 


through  the  proof  of  this  since  it  is  rather  simple). 


a)  If  peP  then  for  every  i  the  vector  p'  with  coordinates 


P 

'  R* 


(P) 


if  j  i  i 
if  j  -  i 


(cf.  the  equation  (32)  associated  with  computation  at  node  i),  also  belongs 


to  Pk. 


b)  If  peP*  and  pep*  then,  for  every  i  and  m,  the  vector  p '  with  coordinates 


pj  ■ 


if  j  f  a 
if  j  ■  m 


(cf .  equation  (31)  associated  with  transmission  from  node  m  to  node  i) , 

_lr 

also  belongs  to  P  . 

c)  If  p(l) , . . . ,p(N-l)  belong  to  P  then  the  vector  p’  with  coordinates 


Pj  »  RjCpCj)) 


»  j  ,N-1 


PN  * 


(cf.  a  computation  (32)  at  each  node  followed  by  a  transmission  to  every 

^c+1 

other  node),  belongs  to  P 

It  is  easily  seen  that  all  the  conditions  stated  above  are  satisfied 


in  our  case  so  the  desired  conclusion  follows. 


Q.E.D. 


Proposition  3  shows  that  the  ARM  has  satisfactory  convergence  when 
P*  has  a  unique  element.  One  way  to  guarantee  this  is  to  consider  the 
optimal  solution  f*  of  the  primal  problem  (1)  and  the  set  of  arcs 


A  ■  {(i,j)eA  l  g^  is  differentiable  at  fjj). 


Them  if  the  graph  (N,A)  is  connected>P*  consists  of  a  unique  point  in 

« 

view  of  the  complementary  slackness  condition  (7).  In  order  to  improve 
the  convergence  properties  when  P*  has  more  than  one  point  it  is  necessary 
to  modify  the  ARM  so  that  a  computation  at  node  i  replaces  p^  with  IT  (p) 
(not  just  any  element  of  R^(p)).  We  call  this  the  maximal  ARM.  If  in 
place  of  R^(p)  we  use  R^(p)  the  resulting  method  is  called  the  minimal  ARM. 

Proposition  4:  Let  Assumptions  1  and  2  hold.  Assume  that  the  starting 
buffer  contents  satisfy 


Wv 
1  •  •  •  * 


P  Ci)  >  P  . 


Vi-  1.....N-1 


Then  if  {pt(i)i  is  generated  by  the  maximal  ARM  we  have 


lia  p  (i)  -  p  ,  i  -  1, . . . ,N-1. 

t-*« 


Proof:  The  proof  is  identical  to  the  one  of  Proposition  3  except  that  the 
set  P  of  (35)  should  be  replaced  by 


P*  -  <p|  P  <  P  <  Stp)>. 


Q.E.D. 


There  is  a  similar  result  for  the  minimal  ARM  whereby  p  is  replaced 
by  £  and  condition  (36)  is  replaced  by  p°(i)  <_  £  for  all  i.  The  follow¬ 

ing  example  demonstrates  that  the  results  of  Proposition  3  and  4  cannot 
be  improved . 

Example:  Consider  the  3-node  network  shown  in  Figure  2. 


Figure  2 


The  arc  costs  are 


g12(£125  *  (f12)2,  g23  "  lf23l  +  (f235  *  g31(f315  “  ^ f 31 ^  +  (f31 


and  the  optimal  primal  solution  is 


The  reduced  dual  optimal  solution  set  is  derived  from  condition  (“)  and  is 
given  by 

?*  •  (pj  p3  »  0,  Pj  ■  p2,  -1  <  Pj  <  1>  -1  <  P2  i  !>• 

The  results  of  Proposition  3  and  4  are  illustrated  in  Figure  3.  To  see  that 
the  ARM  as  well  as  the  maximal  and  minimal  ARM  may  not  converge  to  a  dual 
optimal  solution,  let  the  buffer  contents  of  processors  1  and  2  be  both 
equal  to  (-1,1)  and  let  both  processors  update  the  respective  price  co¬ 
ordinates  and  then  exchange  the  results  of  the  computation.  Then  the  buffer 
contents  will  be  (1,-1),  and  by  repeating  this  process  one  more  time  the 
buffer  contents  will  become  again  (-1,1)  thereby  completing  a  cycle.  There- 
fore  in  general  we  cannot  expect  convergence  of  the  ARM  to  the  optimal 
solution  set  if  the  latter  contains  more  than  one  element.  Similarly 
the  maximal  and  minimal  ARM  need  not  converge  to  p  and  £  respectively 
if  the  initial  buffer  contents  do  not  belong  to  the  appropriate  regions 
[cf.  (36)]. 


i 


Set  of  initial 
buffer  contents 
for  which  the 
maximal  ARM 
converges  to  $ 


Maximal 
Solution  p 


Set  of  initial 
buffer  contents^1 
for  which  the  A1 
minimal  ARM  N/J 
converges  to  p  ^ 


Set  I  to  which  the  /  \ 
ARM  converges'  A 


Minimal 
Solution  p 


Reduced  Dual 
Optimal  Solution 
S*t  P* 


Figure  3:  Structure  of  the  optimal  solution  set,  and  convergence 

regions  of  the  ARM,  the  maximal  ARM,  and  the  minimal  ARM. 


»  . 
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